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ABSTRACT. 

Let X\ 1 A2, ... be a sequence of independent copies (s.i.c) of a real random variable 
(r.v.) X > 1, with distribution function df F(x) = ¥(X < x) and let X\ >n < Xi^ n < 
■■■ < X n ^ n be the order statistics based on the n > 1 first of these observations. 
The following continuous generalized Hill process 

j=k 

T n (r) = k~ T ^2j T (logX„_ j+ i in - logX n _ jtn ) , 
i=i 

r > 0, 1 < k < n, has been introduced as a continuous family of estimators of the 
extreme value index, and largely studied for statistical purposes with asymptotic 
normality results restricted to r > 1/2. We extend those results to < r < 1/2 and 
show that asymptotic normality is still valid for r = 1/2. For 0<t<1/2, we get 
non Gaussian asymptotic laws which are closely related to the Riemann function 
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1. Introduction 

The aim of this note is to settle an asymptotic theory for some func- 
tional forms of Hill's estimators. Precisely, let X\, X2, ... be a sequence 
of independent copies (s.i.c) of a real random variable (r.v.) X with 
distribution function (df) F(x) = F(X < 1), 1 e 1. Since we are only 
concerned with the upper tail of F in this paper, we assume without 
loss of generality that X > 1 and define a s.i.c. of the r.v. Y = logX 
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denoted Yx,Y 2 ,... with df G(x) = P(Y < x) = F(e x ), x > 0. Fi- 
nally Yi tn = logXi „ < ... < Y ntn = logX nn are their respective order 
statistics. The following generalized continuous Hill process 

j=k 

(1.1) T n (r) = k~ T ^2j T (}ogX n ^ j+hn - \ogX n _ jtn ) , 

i=i 

has been introduced and studied in [H] and [5], for a continuous param- 
eter t > 0, and as throughout this paper, 

1 < k = k(n) < n, k — > +oo, k/n — > as n — > +oo. 



But some margins of (1.1) appeared before that, since T n (l) is the Hill 
estimator introduced in [8] in 1975. Also, De Haan and Resnick [1] 
proposed DR n = T n (0)/(log/c) as a simple estimator of the extreme 
value index. It has been shown in pU] that for F e D(A), DR n - when 
appropriately centered and normalized - converges in law to a multiple 
of a Gumbel law. 



However, we are only concerned in this paper with values r > 0. In this 



case, the process ( 1.1 ) provides a family of estimators of the Univariate 
Extreme Value Index as shown in j5] in a sense to be defined shortly, 
after a brief recall on Extreme Value Theory. 

Since this work arises in the just mentioned theory, it would be appro- 
priate to make some reminders before going any further. The reader 
is referred to de Haan [3], de Haan and Ferreira [2], Resnick [12] and 
Galambos [7] for a modern and large account of the Extreme Value The- 
ory. However, the least to say is that the df F is said to be attracted 
to a non degenerated df M iff the maximum X n ^ n = max (Xx, ■■■X n ), 
when appropriately centered and normalized by two sequences of real 
numbers (a n > 0) n>0 and {b n ) n>0 , converges to M, in the sense that 



(1.2) lim P (X n>n <a n x + b n )= lim F n (a n x + b n ) = M{x), 



for continuity points x of M. If (1.2) holds, it is said that F is at 



tracted to M or F belongs to the domain of attraction of M, written 
F G D(M). It is well-kwown that the three nondegenerate possible 



limits in (1.2), called extremal df's, are the following. 



The Gumbel df 

(1.3) A(x) = exp(-exp(-x)), x 6 R, 
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the Frechet df with parameter 7 > 



(1.4) 7 (a;) = exp(-s 7 )I[ 0i+oo [(x), i6l 
and the WeibluU df with parameter /3 > 

(1.5) ij)p{x) = exp(-(a;) /3 )I]_ 00i o](a;) + (1 - l hoofi] (x)), xeR, 
where Ia denotes the indicator function of the set A. 

Actually the limiting df M is defined by an equivalence class of the 
binary relation TZ on the set D of df's on R, defined as follows 

V(Mi,M 2 ) G V 2 ,(M 1 TZ M 2 ) 3(a,b) G M+\{0} x R,V(x G E) : 

M 2 (x) =Mi(aar + 6). 
One easily checks that if F n (a n x + b n ) — > Mi(x), then F n [c n x + d n ) — >■ 
Mx (ax + 6) = M 2 (a;) whenever 

(1.6) ct n /d n — >■ a and (6„ — d n )/c n — > b as n — > 00. 

Theses facts allow to parameterize the class of extremal ay's. For this 



purpose, suppose that (1.2) holds for the three d/'s given in (1.3), (1.4) 



and (1.5). If we take sequences (c n > 0) n >i and (d n ) n >i such that the 



limits in (1.6) are a = 7 = 1/a and b = 1 (in the case of Frechet 
extremal domain), and a = —f3 = —1/a and b = —1 (in the case of 
Weibull extremal domain), and finally, if we interpret (1 + 7a;) -1 / 7 as 
exp(-x) for 7 = (in the case of Gumbel extremal domain), we are 
entitled to write the three extremal aj's in the parameterized shape 

(1.7) G 7 (x) = exp(-(l + 7^r 1/7 ), 1 + ix > 0, 

called the Generalized Extreme Value (GEV) df with parameter 7 G R. 

The parameter 7 is called extreme value index. Originally, Hill(1975) 
[S] introduced the so-called Hill estimator T n (l) of the parameter 7 of 
the Pareto df {l — Cx~ 1//7 } I( x >o)- So did Pickands [llj as well. In 
the same way, the Generalized Hill process T n (r) provides an infinite 
family of estimators for the extremal index value in the sense that for 
any r > 0, for Fg D{G 1 ), < 7 < +00, 

rT n (r) — > 7 a.s as n — >■ +00. 

This motivated Diop and Lo [6] to find out asymptotic normality results 
for this family in order to construct statistical tests. Although their 
results seem to be valuable, they restricted themselves to values r > 
1/2. Actually, this restriction is imposed by the so-called Hungarian 
approximation methods based on M. Csorgo et al. |I] results that they 
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used. It is then necessary to call for another approach to get around 
this obstacle. 

In this paper, we will complete the work of [6] by giving the asymptotic 
law of T„,(r) for < r < 1/2 under the hypothesis F G D(G^), < 
7 < +oo. Our results are also complements to the related ones for 
T n (0) in connection with those in [IT] . [I] and [10]. We point out for 
once that D(Gq) = D(T) for 7 = +00. 



We restrict ourselves ourselves here to the cases < 7 < +00, since 
the case 7 < 0, may be studied through the transform F(x (F) — 
1/')) G -D(C7_ 7 ) for estimating 7. This leads to replace X n _ j +1)T l by 
xo{F) — 1/1/Xj^n in (1.1). However, a direct investigation of (1.1) for 
7 < is possible. This requires the theory of sums of dependent ran- 
dom variables while this paper uses results on sums of independant 
random variabes, as it will be seen shortly. We consequently consider 
a special handling of this case in a dinstinct paper. 



Our achievement is the full description of the asymptotic behavior of 
T n (r), under the hypothesis F G D{G 1 ), < 7 < +00, for < r < 1/2, 
which is still Gaussian for r = 1/2 and non Gaussian for r < 1/2. In 
the latter case, the limiting laws are described in connection with the 
Riemann function. Our best results concern r g]0, l/2[, in which case 
the usual conditions, (C2) is useless and and (C3) becomes very week. 
The asymptotic results become very general (see Subsection 4.2). 



Before we state our results, we need to introduce some representations 
used in the theorems and to describe the conditions we shall require, 
at the appropriate time, in the next section. In Section [3j we will state 
our results and their proofs. In Section |4j statistical illustrations of 
the results and simulation study results are provided while Section [5] 
includes the key tools of this paper, which ends by a general conclusion 
in Section [6] 

2. Representations Tools 

Throughout the paper, we use the usual representation of Yi,Y 2 , ... 
by G~ l {\ — Ui), Cr _1 (l — U2), ••• where Ui, U2, ■■■ are independent and 
uniform random variables on (0, 1) in the sense of equality in distribu- 
tion (denoted by =</) 

{Y v n>l}= d {(^(l -[/,), n> 1}, 



and hence 



{{Yi, n ,Y 2tn ,...Y n>n },n> 1 } 
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(2.1) 

= d {{G-\l - U n , n ),G-\l - U n - ljn ), ...,G-\1 - U 1>n )},n > 1} . 

In connexion with this, we also use the following Malmquist represen- 
tation (see [13], p. 336) 

(2.2) {log( l ^y,j = l,...,n}=<i{E 1 ,...,E n }, 

where Ei,...,E n are independent standard exponential random vari- 
ables. 



Now we recall the classical representations of (i/'s attracted to some 
nondegenerated extremal df. For each df F in the extremal domain, an 
appropriate representation is given for G _1 (l — u) = logF^il — u), u G 
(0,1). 

Theorem 1. We have : 

(1) Karamata's representation (KARARE) 

(a) If F G D(Gj), 7 > ; then there exist two measurable func- 
tions p(u) andb{u) ofu G (0,1) such that sup(|p(w)| , — > 
as u — > and a positive constant c so that 

(2.3) 

G~ l (l-u) =logc+log(l+p(u))-7logu+( / &(t)t _1 dt), < u < 1, 

J u 

where G _1 (m) = inf{x, G(x) > u}, < u < 1 is the generalized 
inverse of G with G' 1 ^) = G-\0+). 

(b) If F G D(G 7 ), 7 < 0, 7 > 0, then y {G) = sup{s, G(x) < 
1} < +oo and there exist two measurable functions p(u) and 
b{u) for u G (0, 1) and a positive constant c as defined in (2.ty 
such that 



(2.4) yo-G^il-u) = c(l+p(w))w~ 7 exp( j b[t)rHt), < u < 1. 

J u 

(2) Representation of de Haan (Theorem 2.4-1 in J3JJ : 
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If G G D(A), then there exist two measurable functions p(u) and 



b(u) of u G (0, 1) and a positive constant c as defined in (2.3) 
such that for 

(2.5) s(u) = c(l + p(u)) exp( f &(t)t _1 tft), < u < 1, 



we /iave /or some constant d G K, 

(2.6) G- 1 (l-u) = d-s(u)+ f sfyt^dt, < u < 1. 

It is important to remark at once that any (if F in the domain of at- 
traction is associated with a couple of functions (p, b) linked to G(x) = 
F(e x ) used in each appropriate representation. Our conditions will rely 
on them. In order to state them, we set first for < r < 1/2 

k 

(2.7) <T) = k- T jT l 3 r - 1 

3=1 

and 

k 

(2.8) ^r) = ^X;j JM . 

3=1 

From there, our conditions are the following, where A is an arbitrary 
real number greater than one. The first is 

k 

(CI) g lt n(p, X)(<r n (T)k T )~ 1 f asn^ +oo 

3=1 

for 

< Oi,„(p, A) = sup \p(u)\. 

0<u<\k/n 

The second is 

k 

(C2) o 2 , n (p, A)(o n (r)A; T ) _1 J T_1 as n +oo 

3=1 

for 

92, n {b, A) = sup \b(u) \ . 

0<u<Xk/n 

The third is 

k 

(C3) 4(p, 6, A)(cr n (r)A;' r ) _1 j T_1 -^Oasn^+oo 

3=1 
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for d n (p,b, A) = max(ji ifl (p, X),g2, n (b, X)\ogk) — )■ as n — )■ oo. 

We finally point out that all the limits in the sequel are meant as 
n — > +00 unless the contrary is specified. We are now able to state our 
results in the following section. 

3. The results 



Theorem 2. Let F G D(A) = D(G ). Then, if (C3) holds, 
(s(A;/n)(7 n (r))- 1 (r n (r) - a n {r)s{k/n)) -> JV(0, 1) 
/or r = 1/2 and 

(sCfc/n)^^))- 1 ^^) - a n (r)s(A;/n)) -> £(r), 
for < r < 1/2, and where 

00 

£(r) = C(2(l - r))-^ r a-)(^. _ 1) 

zs a centered and normed random variable and 

00 

c(s) = J2r s , s>i, 

i=l 

is i/ie Riemann function. 

Let F G D(G 7 ), < 7 < +00. T/jen i/ (CI) and (C2) JioM, 

(a n (r)/ff n (r))(T n (r)/a n (r) -7) -> A/"(0, 7 2 ) 
/or r = 1/2 and 

(a n (T)/a n (r))(T n (T)/a n (r) -7) -> 7 £(r). 



Remark 1. VFe should remark that the methods used here apply for 

< r < 1. Moreover, these results do not extend to parameters r G 

1 1/2, 1 [ since for such ones, k T a n {r) — > 0, while our results are based 
on the hypothesis k T a n {r) — >■ C G]0, +00]. 

Proof. Let us use the representations ( 2.1[ ). We have, for any n > 1, 
{logX n _ j+ i jn = y n _ j+ i >n , 1 < j < n} = d {G _1 {1 - Uj tn ), l<j<n}. 
Our proofs will rely on Lemma [T] below which establishes that 

k- T J2j Tl0 S(^r^) -<t)\ K(r) = d {k-a n {r))-^r\E 3 -l) 



converges in distribution to jV(0, 1) random variable for r = 1/2 and 
to the finite random variable 



3=1 

for < t < 1/2, where the independent standard exponential ran- 



dom variables Ei,E2,... are defined in (2.2). We begin by letting 



F E D(A) = D(Gi h ) for 7 = +00. By (|247we get 
T n (r) = k- T J2i T ( s ( U 3,n) - s(U j+hn )) + k-^f / *(*)/* 

.7=1 .7=1 Ju i>» 



(It 



(3.1; 



S n (l) + S n {2). 



By using (2.5), we have for Ui jTl <v,u< U^, 



s(u)/s{v) = (1 + p(u))/(l + p(v)) exp(- / r l b{t)dt). 



By putting 
(3.2) 

Pi,n,o(p) = sup{|p(w)| ,0 < u < U k+ i, n } and 2) n,o(p) = sup{|6(w)| ,0 < ti < f/fc+l,n}, 
and by remarquing that log(£/fc + i )n /[/ lin ) = O p (logfc), we get 
s(u)/s(v) = (l + 0(pi, n ,o))exp(-Op(flf2,n,ologA;)), 

whenever 

(3.3) ^2,n,ologfc 0, 
and next 

(3.4) sup |s(u)/s(v) - 1| = O p (max(#i )nj0) #2,n,ologA;)) 

Ul,n<U,V<U k n 



and finally, 
(3.5) 



sup 

U 1>n <u,v<U k , 



s(u) — s(v) 



s{k/n) 



O p (max(g hnfi , g 2 , n ,o log k). 



We have then to prove (3.3). But since nk~ l Uk+\, n 1, for a fixed 
value A > 1 such that (C3) holds, we may find for any e > , an integer 
N such that for any n > N , 

F(g 2 , nfl \ogk > g 2 , n hgk) = P(A„) < e. 

Now fix S > 0. By using the sets decomposition 

(|</2,n,ologfe| >S) = (|</ 2 ,n,ologA;| > S) n A n + (\g 2>ri! o log k\ > 5) n A c n , 



we have, for n > Nq 

By letting first n — > oo and next e 4- 0, and by taking (C3) into account, 
we see that (3.3) holds and so does (3.5). By putting together (3.1) 
and (3.5), and by using the preceding techniques, we get for a picked 
value A > 1 like in (C3), for any e > 0, a value N such that for any 
n > N Q and for any 5 > 0, 

k 

>8)< F(d n (p, b, A) {k T a n {r))- x J^f >$) + £, 



SJ1) 



O n \T 



\s(k/n) 



3=1 



where d n (p,b,\) = max(g ln , g 2 ,n log Again, by letting first n | oo 
and next e j. 0, and by using (C3), we get that S n {l){a n {r)s{k/n))^ 1 — > P 
0. We now treat S n (2) and remark that 



S n (2) 



(fcV^r))- 1 J2f / * _1 {s(t)/s(k/n)} dt 

j=l JUj, n 



a n (r)s(k/n) 

= (k T a n (r))- 1 J2f t^dt 

j = l JUj,n 

+ (A;V n (r))- 1 Vf / {s(t)/s(k/n) - 1} /t dt = S n (2, 1)+S n (2, 2). 
We have, by (3.4) and the Malmquist representation (2.2), 

k 

\S n (2,2)\ <O p (l) C /„(p,6,A)(A;V„(r))- 1 ^^- 1 J E; j <O p {\) x 

3=1 

d n ,o(p, b, A)(Fa n (r))- 1 £ j^ 1 ^ - 1) + d nfi (p, b, A)(fcV n (r))- 1 ]T j— 1 I 

3=1 3=1 J 

where d n ,o(p, b, A) = max(gi i ^p, g a.n.o log fc). The first term tends to zero 
since d nfi (p,b,X) -> P by (3.3), and (A; T «7 n (r)) -1 Z)J =1 j 7 " -1 ^' - 1) 
converges in distribution to a finite random variable by Lemma [I] in 
Section [5j We handle the second term also by the previous techniques. 
We choose a value A > 1 like in (C3). For any e > 0, we get value No 
such that for any n > N and for any 5 > 0, 



< (p,6,A)(A;V„(r))- 1 ^r- 1 

3=1 



>5)< £+1 



rf n (p,6,A)(A;V n (r))- 1 ^r- 1 



>*). 
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By letting first n ~\ oo and next e 4- 0, we see by (C3) that this term 
goes to in probability. Finally, by the Malmquist representation (2.2), 
one arrives at 

k 

S n (2, 1) = (fcX(r))- 1 Y^f^Ej + o P (l). 
And this leads to 

k 

S n (2, 1) - a n (a)/a n (r) = (fcX(r))- 1 ^ j^ 1 ^ - 1) = V n (r) 

3=1 

and finally, by summing up what precedes, 

{a n {r)s{k/n))-\T n {r) - a n (r)) = V n (r) + o P (l). 

This concludes the proof since, by Lemma [l] in Section |5j V n (r) con- 
verges in probability to a A/"(0, 1) random variable for r = 1/2 and to 
£(r) for r < 1/2. 

Now let F G D(G y ), < 7 < +00. We will have similar proofs but 
we do not explicit the convergences in distribution as we did in the 
previous case. We have by (2.3) and the usual representations, 

k 

T n (r) = k-^3 T {log(l +p(tf i+1 , n )) - log(l +p(U j , n ))} 
3=1 

+^k- r J23 Tl °S( U J+l,n/U j ,n) + k~ T Y,r T / Kt)/t dt 

3=1 3=1 Ju i- n 

= S n (l)+S n (2)+S n {3). 
We have, for large values of k, 

k 

\S n (l)/a n (r)\ < 2g 1>n M(a n (T)k T r 1 J2j T > 

3=1 



where <7i, n ,o is defined in (3.2), which tends to zero in probability by 
(CI) and Q. Next 

k 

\S n (3)/a n (r)\ < g 2tnfl {b){a n {r)k T )- 1 Y,f^g{U J+hn /U hn ) 

3=1 

k k 

= 92,n,o(b)Mr)k T )- 1 Y,f~\ E i - 1) + 92,nfi{b)^n{r)¥)- 1 Y,r\ 

3=1 3=1 
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where g2, n ,o defined in (3.2). Then S n (3)/a n (r) — > by (C2), Lemma 
[T] and the methods described above. Finally, always by Lemma [TJ we 
get 

k 

{(S n (3) - 7 a n (r)} /a n (r) = TKW^T^r 1 ^ " 1) 

3=1 

and next 

{T n (r) - 7 a n (r)} K(r) = 7 (<7 B (r)*T 1 - 1) + o P (l) 



= jV n {r) + o P (l) 

and this converges in distribution to jV(0, 7 2 ) random variable for r = 
1/2 and to 7 £(r) for r < 1/2. The proof is completed with the remark 
that a„(r) — > r _1 . □ 



4. Remarks, Applications and Simulations 

4.1. An asymptotically Gaussian estimator. Now let us illustrate 
some statistical applications of our results. Let F 6 D{G 1 ). Recall 
that for t > 1/2, rT n (r) has been proved to be asymptotic normal 
estimators of 7 . Our present results extend those obtained to r = 1/2 
and show that this latter value is a critical one since rT n (r) is still an 
estimator of 7 for r < 1/2 but has a non-Gaussian asymptotic behavior. 
Now let us particularize the results for each case and illustrate the 
conditions (CI) and (C2). For r = 1/2, we have under the (CI) and 
(C2), 

2* 1 / a (log fc)- 1/2 (T„(l/2)/a n (l/2) - 7 ) ^ N(0, 7 2 ), 

Thus T n (l/2)/a n (l/2) is a new estimator of the extreme value index 
which is asymptotically normal. Since a n (l/2) — > 1/2, we rediscover 
the general result rT n (r) ->■ 7 . By (S4) below, a n (l/2) = (1/2)(1 + 
0(k~ 1 ^ 2 )) so that we also have 

2A; 1 / 2 (log A;)- 1 / 2 (2T„(l/2) - 7 ) ^ N(0, 7 2 ). 

From there we may form another family of estimators T„,(l/2,a) = 
(2a + (1 — a)/a n (l/2))T„(l/2), for < a < 1, which are asymptotically 
normal since 



(4.1) 



2fc 1 /2(i g k)- l l\T n {\/2, a) - 7 ) -> iV(0, 7 2 ). 
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This parameter a, that may be randomly picked from (0,1), may be 
usefull in applications. The condition (CI) turns to 

k 

gi , n (P,mogkr 1/2 Y,3 1/2 ~ 9i,n(p, A)A;" 1/2 (log k)~ 1/2 0. 

But in many real situations, we may suppose that the df admits a 
derivative so that we may take p(u) = 0, which renders (CI) useless. 
Indeed, suppose that F(x) admits a derivative F'(x) for large values of 
x. A condition for F to belong to _D(Gi/ 7 ), < 7 < +00, is 

(4.2) hm 



x— >oo 



xF'x) 7 

(see [2], p. 17, Theorem 1.1.11). This implies that 

b(u) = - u))' + 7 = uG\G-\l - u)Y l + 7 ^0asn-^oo 

and next, for some constant c > and Mo G (0, 1), 

ruo 

-u) = c-7logw+ / b{t)t' l dt. 

J u 

For such df's, (CI) is effectively useless. The condition (C2) is more 
consistent. However, it is somewhat related to the extreme second 
order condition (see [2], p. 43) and some popular models used in the 
literature. For example, in the Hall one, that is l-F(x) = C 1 x- 1 ^(l + 
C 2 x~ 13 + o(x _/3 )), where (3, Ci and C 2 are positive constants, we have 
for constants C h i = l,2,..., - u) = C^'(l + C 4 u^ + o{u^)) 

and then 

G-\l -u) = logC 3 - 7 logM + 0(M /37 ), 
which may be written, with b\(t) = w^ 7 and Ci(u) = 0(1) as u — > 0, 

G-\l-u) = C 4 -jlogu + c 1 {u) [ [b x {t)/t)dt. 



Suppose now that c\{u) = D or that c\{u) L{b\{t)jt) dt is obtained 
from J 1 c(t)(bi(t)/t) dt by the Integral Mean Theorem with a bounded 
function c(-). Then condition (C2) turns to 

fc^ + v 2 (iogjfe)-V7 n ^_> 

and (C3) to 

We conclude that for r = 1/2, T n (l/2)/a n (l/2) is an asymptotic normal 
estimator of 7 under reasonable assumptions based on the regularity 
on the df F. Simulation studies suggest its good performances as well. 
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4.2. A class of non asymptotically Gaussian estimators. For 

< r < 1/2, we also have under (CI) and (C2), 

(4-3) JT (T n (r)/a n (r) - j) £(r). 

rC(2(l - rj) 

Here again, we get a family of estimators of 7. However, the limiting 
law is not Gaussian. This seems to be new. Remark here that we 
cannot simplify (4.3) to 

k T ~< 

-( r T n (r)- 7 )^£(r) 



rC(2(l-r)) 

like for r = 1/2 because of a n {r) being finite and because of (53) below. 
This results make a connexion of Extreme Value Theory and Num- 
ber Theory since they deeply depend of the Riemann function closely 
related to the prime numbers since, for s > 1, 

as) = f[(i - P -r 1 = —7 + - [°° x -=J^d X , 

- LJ - s — 1 S /1 x s+l 

p=2 J 1 

where the product extends to the prime numbers p > 2 and [x] denotes 
the integer part of x. The limiting law C(r) is characterized by its 
characteristic function 

°° (jf\n 

^oo(t) = exp(^ i^-C(n(l - r))C(2(l - r))^' 2 ), 

n=2 



calculated and justified in (J5TJ). This random variable has all its 
moments finite and is related to the Riemann function. Indeed put 
ipoo{t) = exp(0(t)) and let 4>^ denote the r th derivative function of (f>. 
We easily see that 

00 

V®(*) = (1) Wexp(0(t)), with = ^^(zt)- 1 C(n(l-r))C(2(l-r)) 

n=2 

and 

^(t) = (0( 1 )(t)) 2 + 0( 2 )(t))exp(0(t)), 

with 

00 

^ (2) (*) = E - IJ^r'CKl - r))C(2(l - r))-/ 2 ). 

n=2 

So 

0«(O) = and 0( 2 )(O) = - [(n - l)(zt)- 2 C(n(l - r))C(2(l - r))^ 2 )] n= 
and thus 

EX = 0,EX 2 = 1. 
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One may repeat the same methods to get 

EX 3 = 2C(3(l-r))/C(2(l-r)) 3 / 2 ,EX 4 = 3 + 6C(4(l -r))/C(2(l -r)) 2 

and higher moments. For such values of r, the condition (C2) and (C3) 
become 

02,»(p, A) -»■ 

and 

max(^i )n (p, \),g 2 ,n(b, A)) ->■ 0. 

We get here the interesting thing that (C2) always holds for 
any df F E D(G 7 ),0 < 7 < +00. And this is a quite general 
and rare result in EVT. For 7 = 0, the condition (C3) is also 
very weak since g ln and g 2t „, are only requested to tend to zero 
faster that log k. We get then very general asymptotic laws 
that sound as a compensation of the lack of normality! 



Since these limits seem new in the extreme value index estimation, 
it may be reasonable that we give some comments on how to use 
them. Although we cannot give a simple expression of the df of £(t), 
Fc(t){u) = P(£(r) < it), we may use computer-based methods to com- 
pute approximations of its values. We may fix a great value of k and 
then consider a large sample, of size B, of the random variable 

k 

(4.4) C(r,k) = C(2(l - r) 1 m)- 1 ^r\E i - 1) 

i=i 

where ((s,m) = X]j=iJ~ S 5 with a sufficiently large value of m. We 
surely get the empirical distribution function (edf) of this sample as a 
fair uniform approximation of F^ T y Fix m = 10000, B = 10000, we 
may observe in the right graph in Figure [3j that the d/'s of -Fl(t) for 
t e {0.1, 0.2, 0.25, 0.3, 0.25, 0.3, 0.35, 0.40, 0.48, 0.48} are very close one 
another. When we consider a r uniformly drawn from ]0, l/2[, we ob- 
tain a significantly different df as illustrated in the left graph in Figure 
3, where we simply add this last df to those already in the first graph. 

From there, we are able to compute the probabilities and the quantiles 
of such limiting laws, as illustrated in the Table [T] in the Appendix. 
Statistical tests may then be based on these results. 
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4.3. Simulations. We wish to show the performance of these class 
of statistics as estimators of 7 and compare them to some other ones 
available in the literature, such as Hill's one T n (l) (see [8]), the Pickands 
estimator (see [TT] ) 

Pn(k) = (log 2) 1 log{(X n _fc, n — X n ^2k,n) / '(-X"n-2fc,n ~ -^n-4fc,n)} , 

and the Lo estimator (see L n (k) with 



L 2 n (k) = k 1 2_^2_^jg(iJ)(\ogX n _ j+hn - log X n ^ n ) (log X n _ i+1>n - logX n _ 
j=i i=j 

where g(i,j) = 1/2 for % = j and g(i,j) = 1 otherwise. We may see 
that the performance of these estimators largely depend on the nature 
of the tail 1 — F. In the simple case where 1 — F(x) = x~ 1//7 , all these 
estimators perform pretty well for n = 300, k ~ n ' 75 for 7 = 0.5 
and T n (l/2)/a n (l/2) has particularly good results. They also present 
good performance for the model 1 — F(x) = x -1 / 7 (l + 0(x~ v )) with 
77 = 100. The results are less sound for small 77 (77 = 10 for example) 
for sample sizes around 300. We check the performances in Table [2] 
in the Appendix, where we report the estimated values for 7 for the 
sample case and the MSE, and in Figure [T] all in the Appendix. The 



asymptotic normality of (|4.1|) is illustrated in the graphs of Figure |2| as 
good. 



The simulation results for the asymptotic normality of (4.1) are also 



acceptable since the empirical p-value is high, of order 90%. 



For values r g]0, l/2[, we illustrate the results for r = 0.3. In the graph 
in right in Figure |i| we realize that T n (0.3)/a n (0.3) is a fair estimate 
of 7 in the simple model described above. 



As said previously Figure 3 successfully compares the law of - 
7) in (Ob and that of gjb. 



C(2(1-t)) 



(T n (r)/a n (T) 



5. Lemmas and Tools 

We begin by this simple lemma where we suppose that we are given 
a sequence of independent and uniformly distributed random variables 
U U U 2} ... as in (JO). 
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Lemma 1. Let 

K(r) = ^x:riog(%^). 

Then for r = 1/2, 

cr- l (r)(V n (r)-a n (r))^Af(0,l) 
and for < r < 1 /2, 



where a n (r) and a n (r) are defined in \2. 7D and and 

oo 

£(r) = A(2(l-r))- 1 / 2 ^r (1 - T) (^-l), 
is a centred and reduced random variable having all moments finite. 



Proof. By using the Malmquist representation (2.2), we have 

V n (k) = k- T Y J 3 T ~ l E 3 . 
It follows that E(V n (k)) = a n {k) and ¥ar(V n (k)) = a 2 n (k). Put 

V*{t) = a n (k)-\V n (k) - a n (k)). 

Then 

k 

V:(r) = (k^ n (r))-'J2f- 1 (E ] -l). 

3=1 

For < r < 1/2, we have by (51) below, k T a n (r) -»■ A(2(l - r))" 1 / 2 
and 

oo 

KT(r) C(2(l - r))- 1 / 2 ^^(JB,- - 1) = L(r). 

3=1 

Now, we have to prove that C(t) is a well-defined random variable with 
all finite moments. The characteristic function of V* (r) is 

k k 

^ (r) (t) = ex P (-C(2(l-r))- 1 / 2 X)^ 1 ^)) n(l-itr (l - T) C(2(l-r))- 1 / 2 )- 1 . 

By using the development of log(l — •) and by the Lebesgues Theorem, 
one readily proves that 

^ {T) (t) = exp(X:f; ^i-(^)C(2(l - r))-^) 

j=l n=2 



(5.1) -> VooW = exp(f^ ^C("(l - r))C(2(l - r))-/ 2 ). 



n 

n=2 

Recall as in [TJj, p. 506, that for s > 1, 

s 1 ri-y 

C s = 7 + " / ~^rdx, 

S — 1 S J! X s+i 

where [x] denotes the integer part of x. This leads to 

s(s-l)- 1 <C(s)<s(l + (s-l)- 1 ). 

By using this, we see that the absolute value of the general term of the 
series in (5.1) is dominated, for large values of n, as follows 

\t\ 



! %l-r))((2(l-r))-™/ 2 



n 



< 2(1 - 2r)( 



2(1 -r) 



This shows that ipoo(t) is well defined and characterizes the df of C(t). 
In Section |i| we anticipated by remarking that C(t) is centered and 
has all its moment finite and indicated the way to get them. 

Its remains to handle the case r — 1/2. Let us evaluate the moment 
generating function of V* (1/2) : 

k 

VV n *(l/2)(*) = IlM-llrt^))" 1 ). 

3=1 

But we have cx 2 (l/2) = S(k, 1) ~ k~Hogk and next k 1 / 2 a n (l/2) = 
(log A;) 1 / 2 oo. It follows that for a fixed, for k large enough, 

(5.2) \t i- 1/2 (^/^(l^))- 1 ) < 2t(logA;)- 1 / 2 

uniformly in % > 1. At this step, we use the expansion of ^(fi -i) m the 
neighbourhood of zero : 

= 1+m 2 /2 + A(m), 
where there exists u such that 

< u < u =^> \g(u)\ < 1. 



By using the uniform bound in (5.2 ), for a value of k such that 2t(log k) l l 2 
u , we get 

V^-i,(f r 1/2 (^(l/^)- 1 ) = l + ±(f r 1/2 (^(i/^r 1 ) 2 

+(tj- 1 / 2 (/c 1/2 cr n (l/2)) _1 ) 4 5 0iJ> (t), 
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where \go,j,n(t)\ < 1 for all 1 < j < k. By the uniform boundedness of 
the error term, we have 

log^-i)(tr 1/2 {k 1 ' 2 a n {l/2))- 1 ) = 

\{tr 1/2 (k^a n (l/2)rr + (tr 1/2 (^/V n (l/2))- 1 )%,, n (t) 

{tj- 1 ' 2 (k^iimr^g^t), 
where always \gi : j :Jl (t)\ < 1 for all 1 < j < k. Finally 

k 

Vv„*(i/2)(0 = exp(^log^._ 1) (tr- 1 (A; 1 /V n (l/2))- 1 ) 

3=1 

k 

= exp(t 2 /2 + t 4 g 2 Y,r 2 ), 

3=1 

where for large values of k, 

|s 2 |<(logA;)- 2 . 

Hence 

^v*(i/2)(t) exp(t 2 /2) 

and 

V:(l/2)^Af(0,l), 
which achieves the proofs. □ 



Lemma 2. Define for r > 0, S(k, r) = £r r £* =1 jT r . T/jen 
/or t > 1, 

(51) S(A;,T)~C(r)fc- T , 
/or r = 1/2, 

(52) 5(fc,r)=2(l + 0(fc-^)), 
for t < 1, 

(53) S(k, t) = fc 1 " 2T /(l - r)(l + CKAT 1 ^)) 
and /or r = 1, 

(54) S(A:,t) = (log Jfe) A;" 1 (1 + 0(1/ log A;)). 

Proof. Theses formulas are readily obtained by comparing the X^Li J T 
and J"* x~ T dx, as in classical analysis. □ 
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6. Conclusion 

The family {T n (r),0 < r < 1/2} is only studied for F belonging to 
D(G 7 ), 7 > 0. This includes the negative case under the appropriate 
transform. However the remaining case, that is the Weibull domain, 
presents a radically different approach including sums of dependent 
random variables and deserves a seperated study. Furthermore, we 
still have to develop a Bayesian approach by considering a random 
choice of the parameter r leading to an optimal choice among a large 
class of admissible laws. 
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APPENDIX. We give in this appendix all the tables and figures of 
the paper. 





0.1 


0.15 


0.2 


0.25 


0.3 


0.35 


0.4 


0.45 


0.48 


unif 


F(-1.96) 


0.004 


0.0035 


0.006 


0.007 


0.009 


0.0137 


0.016 


0.0166 


0.018 


0.067 


F(0) 


0.5572 


0.557 


0.549 


0.544 


0.538 


0.54 


0.524 


0.501 


0.524 


0.537 


F(1.96) 


0.9530 


0.9558 


0.9545 


0.9623 


0.961 


0.965 


0.971 


0.9842 


0.966 


0.913 



Table 1. 





IS (1/2) 


Hill Est. 


Pickands Est. 


Lo Est. 


Estimated 7 


0.5047 


0.5008 


0,4992 


0.4733 


MSE 


0.0051 


0.0025 


0.045 


0.009 



Table 2. 



Figure 1. 
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200 600 1000 
vs Hilfs Estimaor 
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1 1 1 1 r 

200 600 1000 

vs Pickands' Estimator 



L 



1 — 1 — 1 — 1 — r 
20D eoo 1000 

vs Lr/s Estimator 



Figure 2. 

ecdf ecdf vs. N(0,1) empirical density 
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Figure 3. 



ecdf of L(fau) for fixed tau's 



tau fixed vs random fan 



CO 
D 



O 
O 





-2 2 4 6 



Figure 4. 
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200 600 1000 
Tn(0.3)vs Hill's Estimator 
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Estimated vs true law 
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